Spectral clustering is amongst the most popular methods for communitydetection in graphs. A key step in spectral clustering algorithms is theeigen-decomposition of the $n{\times}n$ graph Laplacian matrix to extract its$k$ leading eigenvectors, where $k$ is the desired number of clusters among $n$objects. This is prohibitively complex to implement for very large datasets.However, it has recently been shown that it is possible to bypass theeigen-decomposition by computing an approximate spectral embedding throughgraph filtering of random signals. In this paper, we prove that spectralclustering performed via graph filtering can still recover the planted clustersconsistently, under mild conditions. We analyse the effects of sparsity,dimensionality and filter approximation error on the consistency of thealgorithm.
展开▼
机译:谱聚类是最流行的图形社区检测方法之一。频谱聚类算法的关键步骤是对$ n {\ times} n $图拉普拉斯矩阵进行特征分解,以提取其$ k $个前导特征向量,其中$ k $是$ n $个对象中所需的簇数。对于非常大的数据集而言,这实现起来极其复杂。但是,最近发现,通过计算随机信号的图形频谱嵌入近似值,可以绕过特征分解。在本文中,我们证明了通过图滤波进行的光谱聚类仍然可以在温和条件下一致地恢复种植的簇。我们分析了稀疏性,维数和滤波器近似误差对算法一致性的影响。
展开▼